Recognize tone languages using pitch information on the main vowel of each syllable

نویسندگان

  • C. Julian Chen
  • Haiping Li
  • Li Qin Shen
  • Guokang Fu
چکیده

An innovative method for speech recognition of tone languages is reported. By definition, the tone of a syllable is determined by the pitch contour of the entire syllable. We propose that the pitch information on the main vowel of a syllable is sufficient to determine the tone of that syllable. Therefore, to recognize tone languages, only main vowels are needed to associate with tones. The number of basic phonetic units required to recognize tone languages is greatly reduced. We then report experimental results on Cantonese and Mandarin. In both cases, using the main vowel method, while the number of phonemes and the quantity of training data are substantially reduced, the decoding accuracy is improved over other methods. Possible applications of the new method to other tone languages, including Thai, Vietnamese, Japanese, Swedish, and Norwegian are discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pitch Accent and Vowel Devoicing in Japanese

Japanese is widely recognized as a prototypical pitch-accent language, based on the fact that, given the “accent” location or the lack thereof, the tonal pattern of the entire word is totally predictable. Therefore, unlike tone languages, specification of the tone of each syllable is unnecessary. Consequently, it has been argued that, although Japanese may superficially resemble tone languages,...

متن کامل

Pitch Processing in Music and Speech

INTRODUCTION A highly-debated question is to what extent music and language share processing components. Beyond syntax and temporal structure processing, one studied aspect is pitchprocessing in a given domain and across domains (e.g., [1]). Pitch processing is crucial in music. For example, in Western tonal music, it is a form-bearing dimension (next to temporal structures). Pitch processing i...

متن کامل

The Prosody of Nigerian English

Nigerian English is a variety of English which has often been suggested to differ significantly from other varieties of English, especially in the area of prosody. This paper analyses the prosody of Nigerian English and compares it to the prosody of British English and three West African tone languages. Read and semi-spontaneous speech was analysed acoustically. Significant differences were fou...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

Preservation of lexical tones in singing in a tone language

Lexical tones are important for expressing meaning and usually have high priority in tone languages. This can create conflicts with sentence intonation in spoken language and with melodic templates in singing since all of these are transmitted by pitch. The main question in this investigation is whether a language (in our case the Mon-Khmer language Kammu) with a simple two-tone system uses sim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001